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(57) Abstract 

A pre-coelenterazine peptide comprising a modified A. victoria GFP having an amino acid sequence in which Sex 6 * is replaced with 
Tyr. There are further provided a polynucleotide encoding the pre-coelenterazine peptide, allowing synthesis of large, pure amounts of 
coelenterazine, as by culturing organisms transformed with the polynucleotide; methods for synthesizing coeienterazine; and improved 
assays employing the polynucleotide or transformed organisms, eg., to detect mutagenesis. 
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BIOLUMINESCENT INDICATOR BASED UPON THE EXPRESSION OF A GENE FOR A 
MODIFIED GREEN-FLUORESCENT PROTEIN 

Field of the Invention 
This invention concerns a pre-peptide of coelenterazine which 
undergoes transformation to coelenterazine; a polynucleotide which encodes 
5 for the pre-peptide of coelenterazine; living organisms transformed with this 
polynucleotide; methods for synthesizing coelenterazine; and improved 
assays employing the polynucleotide or transformed organisms, e.g., to 
detect mutagenesis. 

10 BACKGROUND OF THE INVENTION 

For some years, it has been appreciated that bioluminescence offers 
a useful indicative tool in a variety of assays. Assays employing bio- 
luminescence enjoy the advantages of accuracy and great sensitivity. The 
accuracy results from the highly specific interaction between luciferase and 

15 its luciferin; consequently, the number of false positive indications is 
minimized. The sensitivity is due to the great sensitivity of light sensing 
^equipment and photomultipliers. When properly designed, such assays may 
additionally offer a quantitative relationship between the level of light 
released and the phenomenon being measured. 

20 

The general chemical reaction underlying the phenomenon of bio- 
luminescence is the oxidation of a substrate ("luciferin* 1 ) by an enzyme 
("luciferase"), usually in the presence of oxygen. An intermediate, ener- 
gized "oxyluciferinV' is formed during the oxidation reaction, which, in 
25 proceeding to the oxidized form ("oxyluciferin"), releases light. This inter- 
action between luciferin and luciferase is seen in the anthazoan coelenter- 
ates such as the sea pansy Renilla reniformis. 
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lt sh „u.d be noted that the terms 'luciferm- and "luclferase" are no. 
s pec,f,c: they are used In the literature to re.er to the blot—, 
s „s„a,e and enzyme o, nearly a„ b.o,um,nesc,ng organs. The spe^h 
, u ci«e„ns found In nature however vary extensively. For exemple. the luc, 

ma ,lne organisms including the iellyflsh and the sea pansy, re dearly 
Loot moleou.es. as shown In Formulae „ and ... respect^ 
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Formula 1 



COOH 



15 



20 



Formula II 
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Formula III 



25 



30 



Th ese three compounds are usual* termed flrefly WJ-J. 

,erln and coelenterate-type luolferln (or -coelenterazme . respectrvely 

avoid confusion. 

Generally, there Is little cross reaction between the luclferase of one 

p h y, u ; however, .e.g.. among coelenteratesl. -s reachons ^ o^ 

4 A 0 n,,nrea victoria coelenterazme with R. renitormis 
thusthecombinafonof^eQuoreawcfor ^ ^ 

,uciferase does generate biolum.nescence. Th.s 
coe.enterates have coeienterazine as their luciferin. 
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Recombinant DNA techniques have helped to elucidate biolumin- 
escent systems and thus have enabled investigators to introduce some 
bioluminescent compounds into assays to provide a convenient, conspic- 
uous indicator. For example, the firefly luciferase gene has been expressed 
5 in Escherichia coli, tuberculosis bacilli and tobacco plants grown in an 
ambient medium containing firefly luciferin. Also, genes for the bacterial lux 
system have been introduced into a number of non-bioluminescent species. 
These bacteria transformed with lux genes have been employed in a wide 
range of assays discussed in Gould and Subramani Anal.Bionh 175 . 5-13; 
10 Stewart and Williams, J.Gen.Microhinl. 138 1289-1300 (1992); Stewart 
and Williams, ASM News, 59., No. 5, 241-246 (1993); and Hill et al., 
Biotechn. App.Bioch- IX 3-14 (1993); all incorporated herein by reference. 

These assays are limited however by the requirement of most bio- 
15 luminescent systems that luciferin, a compatible luciferase and an exo- 
genously added co-factor all be present. In the absence of the substrate, 
enzyme or co-factor, the system does not bioluminesce. Thus, firefly and 
bacterial luciferases will only oxidize firefly luciferin and FMNH 2 if ATP or 
organic aldehydes are present respectively. And since most isolated genes 
20 to date have been for enzymatic luciferases - the biosynthetic pathway of 
luciferins remaining largely unclear - it has been necessary to grow 
organisms transformed with a luciferase gene in media containing the 
luciferin. 

25 This requirement that both luciferase and luciferin be present has 

limited non-bacterial bioluminescing compounds to assays where the cells 
under investigation are transformed with a luciferase and either have cell 
walls and plasma membranes which are permeable to a compatible luciferin 
or which are rendered permeable at some point. Thus, in Ow et al., Science 

30 234, 856-859 (1986), tobacco plants grown from cells transformed with 
the firefly luciferase gene were exposed to a liquid medium containing firefly 
luciferin. The plants exhibited bioluminescence primarily along their major 
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veins. Alternatively, lysis of E co// transformed with a firefly lucif erase 
gene was employed by Lee et aL, Anal.Chem. 64. 1755-1759 (1992), in- 
corporated herein by reference, to indicate mutagenesis. Upon induction of 
a lysogenic bacteriophage carried by the E. co/i, bacterial lysis released the 
5 firefly luciferase into a medium containing firefly luciferin. The level of 
bioluminescence in the bacterial growth medium indicated the level of muta- 
genesis. 

Because these assay systems relied upon cell walls or membranes 
10 which were permeable or had been made permeable to luciferin, they were 
limited to use either on living organisms having a particular cell wall or 
membrane permeability, or on lysed cells. Recently, however, the need for 
cell wall or membrane permeability or for cell lysis has been obviated in US 
Patent application Serial No. 08/119,678, filed September 10, 1993 by 
15 Chalfie et al. (hereinafter "Chalfie et al. M ) This application, which is 
incorporated herein by reference, describes the synthesis in £. coli and 
Caenorhabditis e/egans of the "green-fluorescent protein" (hereinafter 
"GFP") of the jellyfish A. victoria. 

20 GFP is a polypeptide derived from an apopeptide having 238 amino 

acid residues and a molecular weight of approximately 27,000. GFP 
contains a chromophore formed from amino acid residues 65 through 67. 
Investigators have proposed a mechanism for the formation of the GFP 
chromophore. In this proposed mechanism, Tyr 66 in GFP is dehydrogenated, 

25 and later. cyclizes along with its upstream neighbor Ser 66 , as well as its 
downstream neighbor Gly 07 to form the imidazole ring chromophore having 
the structure shown in Formula IV. 
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CH ^ 



o 
II 



H o NyN-CH^C-NH-CH-C-NH- 



I || I I 

N — CH — C - NH-CH CH 

H, ' H 3 C CH 




CH 2 OH 



3 



Cody et al., fiioch. 32 1212-1218 (1993). 

As its name indicates, GFP fluoresces; it does not bioluminesce. In 
1 5 vivo, the chromophore of GFP seems to be activated by energy transfer 
from coelenterazine complexed with the photoprotein aequorin, as in the 
hydrozoa order of coelenterates such as the jellyfish A. victoria. Organisms 
containing GFP thus exhibit green fluorescence at 510 nm, rather than the 
blue wavelength light at 480 nm typical of coelenterazine bioluminescence. 

20 

In the system of Chalfie et al., cells were transformed with a cDNA 
for the 238 amino acid apo-GFP. Expression of this apoprotein of GFP in 
the absence of other jellyfish gene products and was said to result in post- 
translational modification at residues 64 through 69 to form the GFP 

25 chromophore (of Formula IV above). The resulting GFP was said to exhibit 
the characteristic green fluorescence at 510 nm upon irradiation with blue 
or UV light. Thus, cells transformed with the cDNA for apo-GFP may be 
tested for GFP expression simply by irradiation with blue or UV light. No 
cell lysis is required to detect fluorescence; one need not provide a co-factor 

30 or a luciferin. 
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Unlike most bioluminescent systems, which require one or more co- 
factors or a luciferase enzyme in order to release light, GFP fluoresces when 
illuminated with certain wavelengths of light. Thus, organisms transformed 
with the GFP gene can exhibit fluorescence while alive and without lysis, 
5 unlike the firefly luciferin of Ow et al., supra. Further, the GFP gene may 
be operatively linked with a duplicate of a promoter controlling expression 
of a protein of interest; expression of this protein can therefore be mon- 
itored in a living cell by the detection of GFP's fluorescence. Moreover, 
transformation of cells of transparent organisms, such as C. elegans or 
10 zebra fish, permits one to determine the progeny of these cells as the 
organism develops. The GFP system thus provides a tool for detecting 
specific physiological events in vivo as well as for tracking expression of 
proteins of interest. 

15 Coelenterazine is a bioluminescing compound commonly found in 

organisms which synthesize GFP. For example, the jellyfish A. victoria 
produces both compounds. Unlike GFP, coelenterazine is a small non- 
proteinaceous, highly complex molecule. Coelenterazine (or 3,7-dihydro-2- 
methyl-6-{p-hydroxyphenyl)-8-benzylimidazo[1,2-alpyrazin-3-one,Horiand 

20 Cormier, Proc. Nat. Acad. Sci. 70 . No. 1 , 1 20-1 23 (1 973) releases blue light 
across a broad range peaking at 480 nm upon oxidation by luciferase in 
vitro. 

Coelenterazine exhibits bioluminescence at 480 nm in the presence 
25 of a compatible luciferase and oxygen; it does not require a co-factor. 
Despite this advantage, coelenterazine has not been widely adopted for use 
in assays, primarily due to the difficulty and expense of isolating significant 
amounts of coelenterazine. The compound and its luciferase are present in 
bioluminescing organisms at exceedingly low levels: forty thousand sea 
30 pansies (/?. reniformis) are required to collect 0.5 mg of coelenterazine, and 
six thousand sea pansies for a few mg of Renilla luciferase. "General 
Aspects of Bioluminescence," Ward, 321-358, at 344 in Chemi- and Bio- 
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luminescence, J.G. Burr, Ed., Marcel Dekker, Inc., 1985. Furthermore, 
these compounds are chemically unstable, having half lives in aqueous 
systems of one to two hours. While synthetic coelenterazine is available 
commercially (from London Diagnostics, Eden Prairie, Minnesota), the 
5 numerous complex organic reactions involved in its synthesis render it quite 
expensive. Recombinant DNA techniques to date have provided no assis- 
tance, since the natural route of coelenterazine biosynthesis, and indeed 
that of most luciferins, is unknown. 

1 0 Some investigators have theorized biosynthetic pathways for coelen- 

terazine. McCapra and Perring, 359-386, at 371, in Chemi- and Biolumin- 
escence , J. G. Burr, supra, noted some structural similarities between 
coelenterazine and the tripeptide tyrosyl-tyrosyl-phenylalanine, and 
expressed a belief that coelenterazine is derivable from this tripeptide. 

1 5 Elsewhere, McCapra alone pointed out structural similarity between Cyp- 
ridina luciferin (which shares the fused imidazopyrazine ring of coelen- 
terazine) and the tripeptide tryptophanyl-isoleucyl-arginine, and synthesized 
the Cypridina luciferin from a dehydrotripeptide in JCS Chem.Comm. 1972. 
"Cyclisation of a Dehydropeptide Derivative: a Model for Cypridina Luciferin 

20 Biosynthesis" 894-895. Observing a possible connection between a tri- 
peptide and luciferin, in FEBS Lett. 104. 1979, pp. 220-222, Shimomura 
stated: "Partial similarity between structure B [the proposed GFP chromo- 
phore] and the structure of coelenterazine may suggest a biogenetic 
significance." Ward similarly observed in Chemi- and Bioluminfiscftnrp J.G. 

25 Burr, supra, at p. 329 that in view of the chemical similarities between 
Cypridina luciferin and coelenterazine, it was intriguing to speculate that 
coelenterate-type luciferin and the GFP chromophore may be produced by 
a common biosynthetic mechanism involving post-translational protein 
modification (i.e., ring formation) followed by excision of the chromophore 

30 in the case of luciferin synthesis. 
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Despite the references to tripeptides in the biosynthesis of coelen- 
terazine, it has hitherto generally been believed by investigators that 
coelenterazine was not generated through ribosomal peptide synthesis. 
McCapra and Perring's observations above were directed to possible labora- 
5 tory synthesis routes for the luciferins discussed; for they flatly state that 
w [n]one of the chemical syntheses [of luciferins] are amino acid based...." 
McCapra and Perring, supra, at 371 . Rather, due to the compound's struc- 
tural complexity, investigators expected coelenterazine was synthesized 
nonribosomally, in the manner of K-glutamyl-cysteinylglycine ("glutathione", 

10 the natural antioxidant which forms 5-oxoproline in the K-glutamyl cycle) or 
gramicidin. Biochemistry . Voet and Voet, John Wiley & Sons, pp. 709-71 1 , 
941-942; and Biochemistry . Zubay, 2d Ed. f Macmillan Publishing Company, 
p. 796. Accordingly, it has been hypothesized that different organisms 
might possess a series of cooperating enzymes, for the synthesis of 

15 coelenterazine, J.W. Hastings, J.MoI.Evol. 19 , 309-321 (1983); and that 
coelenterazine synthesis occurs in vivo via a sequence of enzymatic or 
chemical reactions, McCapra and Perring, supra, at 375-376. 

The present invention resolves several difficulties in applications of 
20 a coelenterazine bioluminescent system. 

SUMMARY OF THE INVENTION 
Applicants have discovered that by modifying the cDNA for the apo- 
peptide of A. victoria GFP (described in U.S. Patent application Serial No. 
25 08/1 19,678, filed September 10, 1993), a heretofore unknown pre-coelen- 
terazine peptide is synthesized. Polynucleotides encoding this pre-peptide 
allow synthesis of large, pure amounts of coelenterazine and enable 
numerous methods for imparting bioluminescence to organisms under a 
variety of conditions. 



30 



The discovery that GFP may by slight modification be altered to a pre- 
coelenterazine peptide, and that polynucleotides encoding the apopeptide 
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for GFP could be modified to encode for the pre-coelenterazine peptide was 
wholly unforeseen. There existed no earlier indication that coelenterazine 
results from post-translational modification of any peptide; or that coelen- 
terazine is or could be genetically encoded. Nor was there any clear indica- 
5 tion that GFP and coelenterazine might share a common or related 
precursor. 

Moreover, the newly discovered steps believed to be involved in coel- 
enterazine formation from the pre-coelenterazine peptide are substantially 
1 0 different from those believed to take place in the post-translational modifica- 
tion of apo-GFP into GFP. Consequently, the steps involved converting apo- 
GFP into GFP did not foreshadow the dehydrogenation and cyclization 
believed to occur in the conversion of the pre-coelenterazine peptide to 
coelenterazine. 

15 

Without in any way limiting the invention, Applicants believe that pre- 
coelenterazine is transformed into coelenterazine by the dehydrogenation of 
Tyr 66 and/or its upstream neighbor Tyr 65 (which replaces the Ser 65 of GFP). 
Either one Tyr residue and one dehydroTyr residue (or both dehydroTyr 
20 residues) then cyclize with their further upstream neighbor Phe 84 . In this 
cyclization, the peptide bond between residues 63 and 64 and that between 
residues 66 and 67 are broken; cyclization thus brings about excision of 
coelenterazine from the peptide. 

25 Nothing in the Cody et al. mechanism of GFP chromophore formation 

indicates that the replacement of Ser 65 with Tyr 65 in the modified GFP would 
lead to dehydrogenation of both Tyr 66 and Tyr 65 . Nor is there any indication 
that this replacement would lead to an upstream shift of the residues which 
cyclize into a ring; or in two fused rings being formed in place of one: in 

30 GFP, residues 65, 66 and 67 cyclizing into a pyrazole ring, while in pre- 
coelenterazine, residues 64, 65 and 66 cyclize into a fused imadazopyrazine 
ring. 
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The following improvements are a result of Applicants' surprising 
discovery. The first embodiment of the invention is a pre-coelenterazine 
peptide comprising a modified A. victoria GFP in which R 65 is Tyr. Certain 
of these peptides comprise at least amino acid residues R 1 through R 228 of 
5 the modified A. victoria GFP. In the pre-coelenterazine peptides, several 
amino acid residues may vary; R 80 may be Gin or Arg, R 100 may be Phe or 
Tyr, R 108 may be Thr or Ser, R 141 may be Leu or Met, R m may be Glu or 
Lys, and R 219 may be Val or lie. Regardless of which of the above- 
identified amino acyl residues is present at residue 80, 100, 108, 141 , 172 

1 0 and 219, the resulting pre-coelenterazine peptide is formed, when generated 
in vivo or in a cell-free ribosomal system, to yield coelenterazine. One 
suitable pre-coelenterazine peptide has a modified amino acid sequence of 
GFP in which R 65 is Tyr, R 80 is Gin, R 100 is Phe, R 108 is Thr, R 141 is Leu, R 172 
is Glu, and R 219 is Val. In another suitable peptide, R 80 is Gin, R 100 is Tyr, 

15 R 108 is Ser, and R 141 is Met, R 172 is Glu, and R 219 is He. In further suitable 
pre-coelenterazine peptides, any of R 229 through R 238 may be omitted or 
replaced without detriment to the ability of the pre-coelenterazine to release 
coelenterazine. 

20 These peptides may be synthesized by a suitable method such as by 

exclusive solid phase techniques, by partial solid-phase techniques, by 
fragment condensation by classical solution phase synthesis, or by re- 
combinant DNA techniques. 

25 |n_a second embodiment, the invention provides polynucleotides, each 

of which comprises one or more sequences of nucleotide bases collectively 
encoding a modified amino acid sequence of a GFP of A. victoria comprising 
in which R 65 is Tyr. Certain of these polynucleotides include at least R 1 
through R 228 . In the polynucleotides of this embodiment, the nucleotides 

30 encoding for several amino residues may vary: R 80 may be Gin or Arg, R 100 
may be Phe or Tyr, R 108 may be Thr or Ser, R 141 may be Leu or Met, R 172 
may be Glu or Lys, and R 219 may be Val or He. 
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The one or more sequences of bases collectively encoding the pre- 
coelenterazine peptide in these polynucleotides may be uninterrupted by 
non-coding sequences. For example, the polynucleotide may suitably be a 
cDNA encoding for a modified GFP gene of A. victoria in which the nucleo- 
5 tides for amino acid 65 have been mutated from TCT to TAT. This 
polynucleotide is, in the present invention, denominated ofQ(C197A) to 
indicate the mutation in the cDNA of GFP at nucleotide 197 from C to A. 
Further, these polynucleotides may suitably include the incorporation of 
codons "preferred" for expression by selected mammalian or non- 
10 mammalian hosts. 

These polynucleotides may comprise further encoding sequences. 
Thus, one polynucleotide comprises, in addition to the sequence encoding 
the pre-coelenterazine peptide, one or more sequences of nucleotide bases 
1 5 collectively encoding the amino acid sequence of a luciferase compatible 
with coelenterazine. Another polynucleotide comprises, in addition to the 
pre-coelenterazine encoding sequence, one or more sequences of nucleotide 
bases collectively encoding the amino acid sequence of aequorin. 

20 When in an expression vector, all of the above polynucleotides may 

further comprise. 5' or 3' of the one or more polypeptide encoding 
sequences, one or more appropriate regulatory elements controlling 
expression of these sequences. Depending on the type of expression vector 
and regulatory element used, one regulatory element may be operatively 
25 linked to one or more than one encoding sequences. One expression vector 
comprises a polynucleotide comprising sequences of nucleotide bases 
collectively encoding a modified A victoria GFP wherein R 85 is Tyr and one 
or more sequences of nucleotide bases which encode at least one regulatory 
element operatively linked to the sequences encoding the pre-coelenterazine 
30 peptide. Suitably, the regulatory element is from a gene encoding other 
than GFP. 
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Suitably regulatory elements, which are well known to those skilled 
in the art, include promoters and enhancers. The regulatory elements are 
operatively linked to a polypeptide encoding sequence when they control, 
i.e., enable, modulate, activate or deactivate, directly or indirectly, the 
5 expression of these sequences. 

When in an expression vector, the polynucleotide carrying one or 
more appropriate regulatory elements may further carry one or more further 
sequences of bases which collectively confer resistance to an antibiotic, 
1 0 when the polynucleotide is expressed in an organism. One suitable expres- 
sion vector comprises gfa(C197A); another is plasmid TU#132. 

In one expression vector, the regulatory element is a promoter 
selected from the group consisting of promoters from a P450 gene, a pro- 
15 moter activated by a heavy metal, and a promoter from a gene encoding a 
stress protein. 

Another expression vector comprises, in addition to the sequence 
encoding the pre-coelenterazine peptide, one or more sequences of 

20 nucleotide bases collectively encoding a luciferase compatible with 
coelenterazine. This expression vector may have one or more sequences of 
nucleotide bases encoding a further regulatory element operatively linked to 
the sequences of nucleotide bases encoding said luciferase. If desired, the 
regulatory element operatively linked to the sequences encoding pre-coelen- 

25 terazine peptide may be the same as, or different from, the further regula- 
tory element operatively linked to said one or more sequences encoding 
luciferase. 

An expression vector comprising a sequence encoding pre-coelen- 
30 terazine peptide may also further comprise one or more sequences of 
nucleotide bases collectively encoding aequorin. 
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There are further embraced in this second embodiment, a 
polynucleotide comprising one or more sequences of nucleotide bases 
collectively encoding the amino acid sequence of R 1 through R 69 of said 
pre-coelenterazine peptide, as well as an oligonucleotide encoding the amino 
5 acid sequence of amino acid residues 64 through 69 of a pre-coelenterazine 
peptide. One such oligonucleotide has the nucleotide sequence TTC TAT 
TAT GGT GTT CAA. 

,o a third pmhodiment of the present invention, a polynucleotide com- 
1 0 prising one or more sequences of nucleotide bases collectively encoding the 
modified amino acid sequence of a GFP of A. victoria in which R 66 is Tyr, 
including those in which R 80 is Gin or Arg, R 100 is Phe or Tyr, R 108 is Thr or 
Ser, R 141 is Leu or Met, R 172 is Glu or Lys, and R 219 is Val or lie is intro- 
duced as exogenous polynucleotide material into an organism. In certain of 
1 5 these polynucleotides, the sequence of nucleotide bases collectively encode 
at least residues R 1 through R 228 of the modified GFP. 

Transformation of these cells may be performed by techniques well 
known to persons having skill in the art with appropriate expression vectors, 
20 e.g.. plasmid TU#1 32. Methods to introduce exogenous genetic material 
into a cell are well-known in the art. For example, exogenous DNA material 
may be introduced into the cell by calcium phosphate precipitation tech- 
nology. Other technologies, such as the retroviral vector technology, 
electroporation, lipofectiom and other viral vector systems such as adeno- 
■25 associated virus system, or microinjection may be used. For example, a 
bacteriophage carrying a polynucleotide encoding the pre-coelenteraz.ne 
peptide may be used to infect a particular type of bacteria. The infection 
may be subsequently detected by lysing said bacteria or its progeny in a 
medium containing a compatible luciferase. Accordingly, by using bacterio- 
30 phages modified to carry such a polynucleotide, the presence in a sample 
ol particular types of bacteria may be detected. Similarly, a eucarycot.c 
virus carrying the polynucleotide encoding the pre-coelenterazine peptide 

SUBSTITUTE SHEET (RULE 26) 



PCT/US95/01425 

WO 95/21191 

14 

may infect a specific cell type. This infection may also be easily detected 
by lysing said cells in a medium containing a compatible luciferase. 

Organisms into which these polynucleotides may be introduced 
5 include bacterial cells, yeast cells, fungal cells, insect cells, nematode cells, 
plant or animal cell. Suitable bacterial cells include £. coli BL21 (DE3)Lys 
S and E. coli BLR (DE3). 

All of these organisms may additionally be transformed with a second 
10 polynucleotide or expression vector comprising one or more sequences of 
nucleotide bases collectively encoding a luciferase compatible with coelen- 
terazine or collectively encoding aequorin. Squid giant neuron cells trans- 
formed with an expression vector encoding pre-coelenterazine peptide are 
suitable for transformation with an expression vector encoding aequorin. 

In a fourth embodiment , there is provided a method of synthesizing 
a peptide comprising the modified amino acid sequence of a GFP of A 
victoria including at least R 1 through R 228 in which R 65 is Tyr, including those 
in which, R 80 is Gin or Arg, R 100 is Phe or Tyr, R 108 is Thr or Ser, R 141 is Leu 
20 or Met, R 172 is Glu or Lys, and R 219 is Val or He. This method comprises 
incubating a polynucleotide comprising one or more sequences of nucleotide 
bases collectively encoding an amino acid sequence of such a peptide in the 
presence of means for effecting expression of the polynucleotide under 
conditions favorable for the expression of the polynucleotide. 

25 

The step of incubating the polynucleotide may be preceded by trans- 
forming an organism with the polynucleotide, and in which the means for 
effecting expression of the polynucleotide is the transformed organism. 

30 In one variant of this method, there is provided a method of 

synthesizing coelenterazine comprising synthesizing a pre-coelenterazine 
peptide according to the method in Claim 21, and isolating coelenterazine 
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from said means. This embodiment concerns an efficient method for 
expression of coelenterazine such that large amount thereof may be 
produced. Methods to collect or isolate coelenterazine are well-known and 
therefore, coelenterazine may be isolated easily. 

5 

Any of the organisms identified above is suitable; transformation may 
be by any procedures deemed appropriate by those skilled in the art. The 
step of incubating the polynucleotide may be performed by culturing the 
transformed organism for one or more generations under conditions favor- 
1 0 able to growth of the transformed organism and to expression of the poly- 
nucleotide, and the step of isolating coelenterazine may be performed by 
lysing the progeny of the cultured transformed organism to form a cell-free 
extract, and isolating coelenterazine from this extract. 

15 In this method, the means for effecting expression of said poly- 

nucleotide may be E. coli strain BL21(DE3)Lys S (Studier and Moffatt, 
i MaLBjgl 189 113 (1986), incorporated herein by reference) or E. coli BLR 
(DE3), transformed with an expression vector comprising, 5' or 3' of said 
one or more sequences of nucleotide bases collectively encoding the amino 

20 acid sequence of pre-coelenterazine peptide, one or more appropriate 
regulatory elements which collectively enable expression of said poly- 
nucleotide; and one or more sequences of bases which collectively confer 
resistance to an antibiotic upon an organism. One suitable transformed E. 
coli BL21 (DE3)Lys S is E. coli SMC2 (ATCC Accession No. 69553). 



25 



Alternatively, the means for effecting expression of said polynucleo- 
tide when it is a polyribonucleotide may be a cell-free aqueous translation 
system known to those skilled in the art. 

30 The method may further comprise the step of converting isolated 

coelenterazine to a stable form, luciferyl sulfate, as for example by incu- 
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bating isolated coelenterazine with a luciferin sulfokinase. The luciferin 
sulfokinase may suitably be derived from the organism R. ren/formis. 

This embodiment further includes purified coelenterazine and purified 
5 luciferyl sulfate synthesized by one of the above methods. 

A fifth embodiment provides assays employing coelenterazine bio- 
luminescence as an indicator. Several of these assays employ a further 
polynucleotide or expression vector having one or more sequences of 

10 nucleotide bases collectively encoding either the amino acid sequence of 
apo-aequorin or a luciferase compatible with coelenterazine; or an organism 
transformed with one of these polynucleotides. (A luciferase is compatible 
with coelenterazine if, when combined in an aqueous medium with coelen- 
terazine, it generates bioluminescence at or about 480 nm.) One such 

1 5 suitable luciferase is that isolated from the sea pansy R. reniformis and 
encoded in the polynucleotide disclosed in Lorenz et al., Proc. Nat. Acad. Sci. , 
88, 4438-4442, 1991, incorporated herein by reference. A suitable poly- 
nucleotide containing sequences which encode the apopeptide of aequorin 
is available commercially from Sealite Corp. of Atlanta, Georgia. 

20 

As with the above polynucleotides, the luciferase polynucleotide may 
further comprise appropriate regulatory elements and sequences conferring 
antibiotic resistance, and even the polynucleotide comprising sequences 
further may comprise one or more sequences of nucleotide base collectively 
25 encoding the amino acid sequence of the pre-coelenterazine peptide. 

Upon expression of the pre-coelenterazine peptide and luciferase 
genes, organisms which do not naturally bioluminesce at 480 nm exhibit 
bioluminescence at or about 480 nm. These assays may be employed for 
30 a variety of uses, as for example to detect the expression of certain genes 
or proteins of interest in cells; detecting increased levels of intracellular 
calcium ion; or detecting the presence of 0 2 in an anaerobic system. 
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The above-described cells and living organisms are useful to detect 
effects of external stimuli to the regulatory element. The stimuli may have 
direct or indirect effects on the regulatory element. Such effects will be 
detectable through either the induction of expression and production of the 
5 pre-coelenterazine peptide which, upon exposure to a compatible luciferase, 
results in bioluminescence; or through the switching off the expression of 
the pre-coelenterazine peptide. 

These cells and organisms may be used to detect the presence of 
10 certain molecules in various kinds of biological samples such as blood, urine 
or saliva. By operatively linking a regulatpry element which is affected by 
the molecule of interest to a polynucleotide sequence encoding the pre-coel- 
enterazine peptide, the presence of the molecules will affect the regulatory 
element which in turn will affect expression of the pre-coelenterazine 
1 5 peptide. Detection of these molecules may be used for diagnostic purposes. 
An example of such a molecule is a hormone. 

These assays may further be used to localize a protein of interest in 
a cell, both described in Chalfie et al., supra. More particularly, this 

20 embodiment provides a method for selecting cells expressing a protein of 
interest, or for detecting expression of a gene of interest. Thus, the method 
for selecting cells expressing a protein of interest comprises introducing into 
cells the polynucleotide encoding the pre-coelenterazine peptide, and a 
polynucleotide comprising one or more sequences of nucleotide bases col- 

25 lectively encoding said protein of interest. These cells are then cultured 
under conditions permitting expression of the pre-coelenterazine peptide and 
the protein of interest. The cells are then examined for expression of coel- 
enterazine; those which express coelenterazine are thereby selected cells 
expressing the protein of interest. 

30 

There are several conventional means by which one may identify cells 
expressing coelenterazine. One may plate out the cultured cells and grow 
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colonies from each. Samples taken from each colony may be cultured in 
turn, and lysed in a medium containing a luciferase compatible with coelen- 
terazine. The exhibition of bioluminescence in this medium confirms the 
expression of the pre-coelenterazine peptide, and thus of the protein of 
5 interest. Alternatively, one could also, at the time the cells are transformed 
with the polynucleotide encoding pre-coelenterazine peptide, further trans- 
form the cells with a polynucleotide comprising one or more sequences 
which collectively encode for a compatible luciferase. Cells exhibiting 
bioluminescence could then be isolated by means of well known to persons 
10 skilled in the art. 

The above cells and organisms are also useful in methods for detect- 
ing expression of a gene of interest. This method comprises introducing 
into a cell a polynucleotide comprising one or more sequences of nucleotide 

15 bases collectively encoding a regulatory element operatively linked to the 
gene of interest, as well as a polynucleotide which encodes for the pre- 
coelenterazine peptide, such that the regulatory element of the gene 
controls expression of pre-coelenterazine peptide, The cells are then 
cultured in conditions permitting expression of the gene of interest and of 

20 the pre-coelenterazine peptide. One then detects the expression of coelen- 
terazine in the cell by means well known to the art, thereby indicating the 
expression of the gene in the cell. 

A method for detecting increased levels of intracellular calcium ion 
25 comprises the steps of culturing an organism transformed with a poly- 
nucleotide comprising one or more sequences of nucleotide bases collec- 
tively encoding a pre-coelenterazine peptide, and a second polynucleotide 
comprising one or more sequences of nucleotide bases collectively encoding 
aequorin. The organism is cultured under conditions favorable to its growth 
30 and to expression of the pre-coelenterazine and aequorin peptides and 
monitored for exhibition of bioluminescence. The exhibition of biolumin- 
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escence by the cultured organisms indicates that intracellular levels of 
calcium ion have risen above normal cytoplasmic levels. 

Aequorin and coelenterazine complex into a heterotetramer in coelen- 
5 terates of the order hydrozoa; they also do so when synthesized in the 
transformed organism. The usual intracellular levels of calcium ion of 
approximately 10' 7 M are insufficient to trigger the heterotetramer form to 
exhibit bioluminescence. However, intracellular concentrations of calcium 
ion rise following opening of ion channels in theplasma membrane, damage 

10 to the cell membrane, or release of calcium ion into the cytoplasm from its 
intracellular repositories, the mitochondria or endoplasmic reticulum. 
Following these events, intracellular calcium ion concentrations may rise to 
levels of 10' 5 -10 3 M- These levels are more than adequate to trigger the 
heterotetramer to bioluminesce. Accordingly, this method permits one to 

1 5 monitor the frequency of calcium ion concentration increases and to evalu- 
ate physiological events which accompany these increases. Suitably the 
cell which is transformed is the squid giant neuron. 

This assay improves on the conventional assays employing intracellu- 
20 lar aequorin (described in Grynkiewicz G. et al., J.Biol.Chem. 260 3440-50 
(1985); Tsien R.Y. et al.. Trends Biochem.ScL 11 450 (1986); and Gilkey 
J. C. et aL, J. Cell. Biol. 76 448-466 (1978), all incorporated herein by refer- 
ence) because the level of aequorin and coelenterazine in the cell may be 
more accurately controlled by appropriate selection of the regulatory 
25 elements. 

A method for detecting the presence of 0 2 leaks into an anaerobic 
system comprises the steps of culturing an organism transformed with a 
polynucleotide comprising one or more sequences of nucleotide bases 
30 collectively encoding a pre-coelenterazine peptide, and a second poly- 
nucleotide comprising one or more sequences of nucleotide bases collec- 
tively encoding a compatible luciferase. The organism is cultured under 
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conditions favorable to growth of said transformed organism and favorable 
to expression of pre-coelenterazine and the luciferase, where the organism 
is a facultative anaerobe; and monitoring the culture for exhibition of bio- 
luminescence. Any exhibition of bioluminescence by the culture indicates 
5 that 0 2 has leaked into the anaerobic system. 

In a sixth embodiment , the invention provides an organism trans- 
formed with two polynucleotides. The first polynucleotide comprises one 
or more sequences of nucleotide bases collectively encoding a pre-coelen- 

1 0 terazine peptide comprising a modified A victoria GFP having an amino acid 
sequence in which R 65 is Tyr. The second polynucleotide comprises one or 
more sequences of nucleotide bases collectively encoding an amino acid 
sequence for a luciferase peptide compatible with coelenterazine. One of 
the polynucleotides has a mutation which precludes a bioluminescent inter- 

15 action between their expression products. The mutation is desirably 
reversible upon exposure of the transformed organism to a mutagen; 
reversal of the mutation enables a bioluminescent interaction between the 
two expression products. 

20 Any of the organisms identified above is suitable; transformation may 

be by any procedure deemed appropriate by those skilled in the art. The 
transformed organism may be employed in an assay to detect mutagenesis, 
as in a modified "Ames test." Ames et al., Proc.Nat.Acad.Sci., 70, 782- 
786 and 2281-2285 (1973), incorporated herein by reference. 

25 

This embodiment further provides a method of detecting mutagenesis 
caused by a chemical compound suspected of being a mutagen. The 
method comprises the steps of transforming a population of organisms with 
both of the first and second polynucleotides described above; growing a 
30 culture of said transformed organisms through one or more generations in 
a nutrient medium comprising said chemical compound; and measuring the 
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bioluminescence of said culture and comparing said bioluminescence to that 
from a culture of non-transformed mutagenized control organisms. 

DESCRIPTION OF THE DRAWING 
5 Figure 1 is the print out of a luminometry test of coelenterazine 

isolated from E. coli SMC2. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Abbreviations and Conventions. The usual conventions for indicating 
polypeptides (written with N-terminal to left, C-terminal to right) and poly* 
nucleotides (written 5' to left, 3' to right) are followed herein. The residues 
of the pre-coelenterazine peptide are numbered according to Prasher, et al. f 
Gene ILL 229-233, 1992, (incorporated herein by reference) beginning 
with R 1 at the N-terminal and proceeding sequentially toward the C-terminal. 

Any DNA disclosed as an individual single-stranded DNA also is 
considered to disclose the double-stranded DNA forming the same, as well 
as RNA equivalent thereto. 

20 Applicants state at several points herein that methods, techniques, 

organisms, and various means for carrying out identified procedures, all of 
which are well known in the art, may suitably be used. This statement is 
not to be interpreted that every possible alternative means is equally 
desirable and effective; the choice among these alternative techniques, 

25 organisms and combinations thereof is left to the skill and discretion of one 
skilled in the art. Conversely, Applicants' statements herein that certain 
specific techniques and organisms may suitably be used is not to be inter- 
preted that these specified techniques or organisms, or certain combinations 
thereof, are particularly preferred. The identification of these techniques 

30 and organisms is merely exemplary. 



10 



15 
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A. PRE-COELENTERAZINE PEPTIDE 

The primary structure of the pre-coelenterazine peptide is substan- 
tially based on that of the GFP of A. victoria appearing in Prasher et al., 
supra, except that Ser 65 appearing in the chromogenic sequence of amino 
5 acyl residues of apo-GFP, i.e., Phe^-Se^-Tyr^-Gly^-Val^-GIn 69 , is replaced 
by Tyr. 

Variations are seen in the amino acid sequence of wild A. victoria 
jellyfish GFP at several residues: R 80 may be Gin or Arg, R 100 may be Phe 

10 or Tyr, R 108 may be Thr or Ser, R 141 may be Leu or Met, R 172 may be Glu or 
Lys, and R 219 may be Val or lie. These same replacements may be made 
in the pre-coelenterazine without substantial prejudice. The length of the 
pre-coelenterazine peptide may also be subject to minor differences in 
length: i.e., the primary sequence of the peptide may slightly exceed or fall 

1 5 short of 238 amino acyl residues. 

This length variation may arise when the peptide is derived from a 
polypeptide having 5' or 3' termini to which "sticky end" nucleotide 
sequences have been added by procedures known to persons skilled in the 
20 art to facilitate insertion of the polynucleotide into an appropriate vector. 

For purposes of this disclosure, additional amino acid residues in the 
pre-coelenterazine peptide are indicated, not by altering the numbering of 
residues, but by considering the extra residues as one with either R 1 or R 238 . 
25 Thus, in the peptide of SEQ ID NO: 1 R 1 is Xaa is methionyl-alanine. 

Conversely, when one or more amino acid residues are deleted due 
to the introduction of restriction sites in the cDNA, the amino acid residues 
are numbered according to the number they would have held in GFP. Thus, 
30 if R 1 through R 3 were omitted, the N-terminal Gly would nevertheless be 
numbered R 4 in the truncated peptide. 
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All of these pre-coelenterazine peptides may be synthesized by a 
suitable method such as by exclusive solid phase techniques, by partial 
solid-phase techniques, by fragment condensation or by classical solution 
phase synthesis. For example, the techniques of exclusive solid-phase 
5 synthesis are set forth in the textbook "Solid Phase Peptide Synthesis", 
J.M. Stewart and J.D. Young, Pierce Chem! Company, Rockford, 111, 
1984 (2nd. ed.}, and M. Bodanszky, "Principles of Peptide Synthesis", 
SpringerVerlag, 1984. The peptides may suitably be prepared using solid 
phase synthesis, such as that generally described by Merrifield, 
10 J.Am.Chem.Soc 85, p. 2149 (1963), although other equivalent chemical 
syntheses known in the art may also be used as previously mentioned. 

Alternatively, each of these peptides may be made using recombinant 
DNA techniques. This may be done, for example by generating a poly- 

1 5 nucleotide which encodes, according to the genetic code of chromosomal 
DNA, the amino acid sequence of the desired pre-coelenterazine peptide. 
This polynucleotide, introduced into an expression vector, may be expressed 
in vitro or in vivo. The polynucleotide may be generated by procedures well 
known in the art, e.g., by DNA or RNA synthesis techniques and/or devices 

20 or by introducing one or more point mutations into the gene for GFP. Other 
suitable methods for this and other recombinant DNA techniques are 
discussed in Sambrook, Molecular Cloning: A laboratory manual . 2nd Ed., 
Cold Spring Harbor Laboratory Press (1989). 

25 The invention further concerns a peptide derived from the pre- 

coelenterazine peptide in which one or both R 65 and R 86 are dehydroTyr. 
This peptide may be generated by synthesizing the pre-coelenterazine 
peptide in vivo. 



30 B. POLYNUCLEOTIDE ENCODING THE PRE-COELENTERAZINE PEPTIDE 



In a second embodiment, the invention provides a polynucleotide 
comprising one or more sequences of nucleotide bases collectively encoding 
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the modified amino acid sequence of a GFP of A. victoria in which R 65 is 
Tyr. 

The polynucleotide may also be comprised of one or more sequences 
5 of nucleotide bases collectively encoding the modified amino acid sequence 
of a GFP of A. victoria in which R 65 is Tyr, and further, in which R 80 may be 
Gin or Arg, R 100 may be Phe or Tyr, R 108 may be Thr or Ser, R 141 may be Leu 
or Met, R 172 may be Glu or Lys, and R 219 may be Val or lie. Two suitable 
polynucleotides comprise one or more sequences of nucleotide bases collec- 
10 tively encoding for such a pre-coelenterazine in which R 80 is Gin, R 100 is Phe, 
R 108 is Thr, R 141 is Leu, R 172 is Glu, and R 219 is Val; or in which R 80 is Gin, 
R 100 is Tyr, R 108 is Ser, R 141 is Met, R 172 is Glu and R 219 is He. 

These polynucleotides may be composed of either DNA or RNA, and 
15 may be either single or double stranded. 

These polynucleotides may have, in the one or more sequences 
collectively encoding the pre-coelenterazine peptide, any nucleotide 
sequence which encodes one of the pre-coelenterazine peptides under the 

20 chromosomal genetic code. This code is degenerate; thus, the Arg at 
residue 109 may be encoded by nucleotide bases as CGT, but could, under 
the code, equally be CGC, CGA, CGG, AGA or AGG, and still encode for 
Arg. Similarly, nucleotide bases encoding Tyr as TAT could equally be TAC 
and still encode Tyr for residue 65 in the peptide. Alternatively, the 

25 polynucleotide may be a cDNA (or RNA equivalent) which includes, in 
addition to the nucleotides for Ser 65 being altered from TCT to TAT, 
mutations encoding for R 80 may be Gin or Arg, R 100 may be Phe or Tyr, R 108 
may be Thr or Ser, R 141 may be Leu or Met, R 172 may be Glu or Lys, and R 219 
may be Val or He. Thus, for example, the polynucleotide may include one 

30 or more of the following base mutations: the bases encoding Gin 80 may be 
altered from CAG (for Gin) to CGG (for Arg); the bases encoding Phe 100 may 
be altered from TTC to TAC (Tyr); bases for Thr 108 may be altered from 
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ACA to AG A (for Ser); bases for Leu 141 may be altered from CTG to ATG 
(for Met); bases for Glu 172 may be altered from GAA to AAA (for Lys). All 
these polynucleotides which encode for the pre-coelenterazine peptides are 
found, upon expression, to direct the synthesis of one of the pre-coelen- 
5 terazine peptides, and hence are embraced in the invention. 

To generate a polynucleotide having these encoding sequences, one 
may introduce one or more point mutations into the cDNA for the GFP of A. 
victoria in the cDNA described in Prasher, et al., supra, using in vitro 
10 mutagenesis methods well known to those skilled in the art. If desired, one 
may make further mutations to effect changes at the nucleotides encoding 
amino acid residues 80, 100, 108, 141 and 219 as desired. 

The abbreviation gfp.(C197A) is used herein to designate the cDNA 
15 sequence of SEQ ID NO: 2. Nucleotides 1 through 717 of gfp(C197A) 
encode the amino acid sequence of the pre-coelenterazine peptide, i.e., the 
modified A. victoria GFP of SEQ ID NO: 1 . The alteration of R 65 from Ser 
to Tyr is effected by the mutation of nucleotide 1 97 from C to A. The poly- 
deoxyribonucleotide sequence of gfpJC197A) appears in SEQ ID N0:2, 
20 where bases 1 through 717 are encoding bases; the mutated base 197 is 
A; and the triplet codon formed by bases 718 through 720 form the stop 
cod on. 

All of the above polynucleotides may further comprise, 5' or 3' of 
25 said one or more sequences of bases, one or more appropriate regulatory 
elements which collectively enable expression of said one or more 
sequences of bases encoding said pre-coelenterazine peptide. 

Suitably, the regulatory element may be a promoter. Suitable 
30 promoter elements include a promoter activated by heavy metal (e.g. the 
one described in Freedman, et al. J. Biological Chemistry . 268: 2554, 1 993, 
incorporated herein by reference); a P450 promoter (e.g. the cytochrome 
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P450); or a promoter for a stress protein, (e.g., described in Stringham, et. 
al., Molecular Bioloov of the Cell , 3: 221 , 1 992), one of said stress proteins 
being a heat-shock protein. Other suitable promoters include that of the 
arabinose operon (phi 80 dara) or the colicin E1, galactose, alkaline phos- 
5 phatase or tryptophan operons. Similarly the ADH system may be employed 
to provide expression in yeast. Alternatively, the regulatory element may 
be an enhancer. 

The regulatory elements are operatively linked with the polypeptide 
1 0 comprising one or more sequences of nucleotide bases collectively encoding 
an amino acid sequence of a pre-coelenterazine peptide; i.e., the regulatory 
elements are placed on the polynucleotide 5' or 3' of the one or more 
sequences suitable to enable expression of the sequences. 

1 5 Polynucleotides which bear one or more of such regulatory elements 

may be used in transforming organisms, as when suitably the polynucleotide 
is included in an expression vector. The regulatory elements are selected 
for compatibility with the organism into which the polynucleotide is to be 
incorporated by transformation, i.e., the regulatory elements are those 

20 which may be recognized by the transformed organism or cell and which 
will aid in controlling the expression of said polynucleotide in the 
transformed organism. 

Thus when the organism to be transformed is E. coli, the regulatory 
25 element may be a promoter (e.g., the T7, the SP6 or lac promoter); or 
transcription initiation sequences for ribosome binding (e.g. the Shine- 
Delgarno sequence and the start codon AUG). When the organism to be 
transformed is eucaryotic, the regulatory elements may include a hetero- 
logous or homologous promoter for RNA polymerase II and/or a start codon 
30 AUG. For example, when the target of transformation is a mammalian cell, 
the regulatory element may be a promoter (e.g. the SP 40 or the bovine 
papilloma virus promoter). Suitable regulatory elements for use in other 
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microbe, or in animal, or plant cells may be selected according to criteria 
well known to persons having skill in the art. All of these regulatory 
elements may be obtained commercially (individually or incorporated into a 
vector) or assembled by methods well known in the art. 

5 

Where the polynucleotide carries one or more appropriate regulatory 
elements, there may further be present one or more further sequences of 
bases which collectively confer resistance to an antibiotic when the 
polynucleotide is expressed in an organism. Such genes for antibiotic 
10 resistance are desirable components for expression vectors since they 
facilitate identification of transformed cells grown in the presence of 
antibiotics, and exert a continual pressure on the transformed organisms to 
retain and express the expression vectors. One polynucleotide suitable for 
use in transforming bacteria E. coli 'xs plasmid TU#132. 

15 

All of the above polynucleotides may be synthesized by known 
methods. The polynucleotide may be generated by procedures well known 
in the art, e.g., by DNA or RNA synthesis techniques and/or devices or by 
introducing one or more point mutations into the gene for GFP. Other 
20 suitable methods for this and other recombinant DNA techniques are 
discussed in Sambrook, supra. 

Thus one may use a DNA synthesizing device to construct the entire 
polynucleotide or to synthesize several fragments of a polynucleotide and 
25 ligate these together. This is a laborious process for polynucleotides, and 
thus it is usually preferable to generate polynucleotides by other means 
known to those skilled in the art. 

The embodiment further includes a polynucleotide comprising one or 
30 more sequences of nucleotide bases collectively encoding the amino acid 
sequence of R 1 through R 69 of the pre-coelenterazine peptide, as well as an 
oligonucleotide comprising nucleotide bases encoding the sequence of 
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amino acid residues 64 through 69 of the pre-coelenterazine peptide. One 
such oligonucleotide has the nucleotide sequence TTC TAT TAT GGT GTT 
CAA. Such poly- and oligonucleotides are useful as probes for homologous 
sequences of DNA or RNA. The attachment of a chemical label to such 
5 probes facilitates locating the probes in a test system. Suitable labels 
include radioisotopes (e.g. 32 P, 3 *S, 126 l), fluorescent compounds, or other 
well known labels (e.g. biotin) covalently linked to the poly- or 
oligonucleotide. 

1 0 Although probes are normally used with a detectable label that allows 

easy identification, these poly- and oligonucleotides are also useful in 
unlabeled form, both as precursors of labeled probes and for use in methods 
that provide for direct detection of double-stranded DNA or DNA/RNA. 

15 C. ORGANIS MS TRANSFORMED WITH POLYNUCLFQTIDE ENCODING PRE- 
COELENTRAZINE PEPTIDE 

A third embodiment of the present invention is an organism trans- 
formed with a polynucleotide comprising one or more sequences of nucleo- 
tide bases collectively encoding the modified amino acid sequence of a GFP 

20 of A. victoria in which R 65 is Tyr, R 80 is Gin or Arg, R 100 is Phe or Tyr, R 108 
is Thr or Ser, R 141 is Leu or Met, R 172 is Glu or Lys, and R 219 is Val or lie. 
Organisms which may suitably be transformed with such polynucleotides 
include bacterial cells, yeast cells, fungal cells, insect cells, nematode cells, 
plant or animal cell. Suitable animal cells include, but are not limited to 

25 Vero cells, HeLa cells. Cos cells, CV1 cells and various primary mammalian 
cells. 

Transformation of these cells may be performed by techniques well 
known to persons having skill in the art. Thus, for instance, transformation 
30 of yeast and plant cells must be preceded by treatment of the cells to 
remove the rigid cell wall, as by treatment with a digestive enzyme; the 
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resulting spheroplasts and chloroplasts readily take up polynucleotide 
plasmids and upon return to growth medium regenerate their cell walls. 

One suitable transformed organism is E. coli SMC2, an E. coli of the 
5 Strain BL21 (DE3)Lys S which has been transformed with Plasmid TU#1 32. 
E. coli SMC2 and plasmid TU#132 were deposited on February 4, 1994 
with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, 
Rockville, MD 20852, USA, under the provision of the Budapest Treaty for 
the International Recognition of the Deposit of Microorganisms for the 
10 Purposes of Patent Procedure and Title 37 Section 1.801 et seq. of the 
Code of Federal Regulations, and accorded ATCC Accession Nos. 69553 
and 75666 respectively. 

It is noted that the deposited material is not considered to be essen- 
1 5 tial to the practice of the claimed invention and that the grant of admission 
to the depository to distribute samples of the biological material does not 
constitute an express or implied license to practice the invention claimed in 
any patent issuing from the instant application or from any continuation, 
divisional or reissue application thereof. 

20 

Another suitable cell is E. coli strain BLR (DE3) (A. Roca, University 
of Wisconsin, cited in the Novogen Catalogue). When this strain is trans- 
formed with an expression vector comprising a polynucleotide having one 
or more sequences of nucleotide bases collectively encoding a pre-coelen- 
25 terazine peptide, the resulting strain produces coelenterazine more stably. 
The stability of this production is believed to be due to the reduced recom- 
bination of the host. E coli strain BLR (DE3) may suitably be transformed 
with an expression vector based on pET1 1 . 
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D. METHOD OF SYNTHESIZING PRE-COELENTERAZINE AND 
COELENTERAZINE 

The fourth embodiment of the present invention comprises a method 
of synthesizing a pre-coelenterazine peptide comprising a modified A 
5 victoria GFP having an amino acid sequence in which R 65 is Tyr. The 
method comprises incubating a polynucleotide comprising one or more 
sequences of nucleotide bases collectively encoding an amino acid sequence 
of such a peptide in the presence of means for effecting expression of the 
polynucleotide under conditions favorable for the expression . of the 
10 polynucleotide. These means may be an in vitro transcription/translation 
system or an organism transformed with the polynucleotide. 

In this method, the means for effecting expression of the poly- 
nucleotide may be either an in vitro cell-free translation system or an 
1 5 organism which has been transformed with the polynucleotide, being viable 
and in a medium containing assimilable sources of carbon, nitrogen, and 
inorganic substances. One suitable means for effecting expression in vivo 
is E. coli SMC2. 

When the means for effecting expression is such a transformed 
organism, the polynucleotide may comprise one or more appropriate 
regulatory elements, and one or more sequences of bases which collectively 
confer resistance to an antibiotic upon the transformed organism. One 
suitable polynucleotide is Plasmid TU#132. 

There is further provided a method of synthesizing coelenterazine 
comprising synthesizing the pre-coelenterazine peptide according to the 
above method and isolating coelenterazine from the means for effecting 
expression of the polynucleotide. This embodiment provides an efficient 
method for expression of coelenterazine such that large amounts of the 
compound may be produced. Methods to isolate expressed protein have 
been well-known and therefore, coelenterazine may be isolated easily. 
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In this method, the means for effecting expression may be an 
organism capable of expressing the polynucleotide, i.e., one transformed or 
transformable with the polynucleotide when subjected to conditions favor- 
able to transformation in the presence of the nucleotide. The transformed 
5 organism and the transformable organism (once the latter is transformed) 
is cultured for one or more generations under conditions favorable to growth 
of said organism and to expression of the polynucleotide. In this method, 
the step of isolating coelenterazine is performed by lysing the progeny of 
said cultured transformed cells to form a cell-free extract, and isolating 
10 coelenterazine from said extract. 

To maximize expression of the pre-coelenterazine peptide, the 
sequence flanking the translation initiation codon may be modified (reviewed 
by Kozak, 1 984), compilation and analysis of sequences upstream from the 
15 translation start site in eucaryotic mRNA's Nucl. Acids Res. 12:857-872, 
incorporated herein by reference. A sequence may then be generated to 
produce higher amounts of the pre-coelenterazine peptide. In addition, 
artificial introns may be introduced so as to increase the production of the 
protein. 

20 

The transformed cell selected to express the polynucleotide encoding 
the pre-coelenterazine peptide also affects the level of expression. 
Expression may also be boosted by employing said method, as a means for 
effecting expression of the polynucleotide, a cell selected from the group 
25 consisting of £ coli SMC2 (ATCC Accession No. 69553) and £ coli BLR 
(DE3) transformed with the pET3a expression vector described above. 
These cells are suitably cultured as described in Example III below, 
optionally in the presence NADP. 

30 The conditions of growth also may be modified in order to raise the 

level of pre-coelenterazine produced. When the transformed cells are 
cultured at 30°C and in the absence of IPTG until they reach log growth 
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phase. At this stage, when a large number of cells are present, IPTG 
(suitably 0.5mM_to 1mM) is added to induce expression of the polynucleo- 
tide encoding the pre-coelenterazine peptide. After inducing for a suitable 
time, the cells are harvested, and coelenterazine may be collected. 

5 

Coelenterazine is a highly labile substance in aqueous based media 
with a half-life of one to two hours. When it is suspended in methanolic 
HCI, however it is stable, even at room temperature. Accordingly, the step 
of isolating coelenterazine from said extract may be performed by adding 
10 methanolic HCI to the cell lysate, mixing and removing the suspended cell 
solids. 

It is well known that coelenterazine may be stabilized in aqueous 
media by being modified from its keto form (Formula III) to its enol form in 
1 5 luciferyl sulfate. The enol sulfate form of coelenterazine has conventionally 
been termed luciferyl sulfate and this term is employed herein. Luciferyl 
sulfate has the structure shown in Formula V: 

Formula V o 3 so^ ^-^-oh 

20 




This modification to luciferyl sulfate is carried out by incubating 
25 coelenterazine with the enzyme luciferin sulfokinase and 3'.5'-diphospho- 
adenosine. The conversion of luciferyl sulfate to luciferin is 3,5-diphospho- 
adenosine-linked. This may be accomplished by incubating the isolated 
coelenterazine with a compatible luciferin sulfokinase. suitably a luciferyl 
sulfokinase derived from a coelenterate such as A reniformis. Methods well 
30 known in the art may be used to isolate and purify luciferyl sulfokinase, 
e.g.. methods described in Cormier et al.. ,),Cell.Physi Q l. 81. No. 2. 291- 
297 (1973), incorporated herein by reference; Hori et al., Biochim. 
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Bioph Y s.Actg 256: 638-644 (1972), incorporated herein by reference; and 
Karkhanis and Cormier, BjficjL 10:31 7-326 (1971), incorporated herein by 
reference. Luciferin sulfokinase may be isolated from this organism 
according to the procedures described in Cormier et al., Bioch. 9 1184- 
5 1189, 1970, incorporated herein by reference. 

Purification of the coelenterazine from the lysate may be accom- 
plished using chromatography procedures well known in the art. Such 
procedures include size exclusion chromatography, column chromatography 
0 and high performance HPLC using one or more reverse phase HPLC pro- 
cedures. Combinations of such chromatography methods may also be 
employed. In this manner coelenterazine is obtained in purified form. 

Coelenterazine synthesized by these methods may be characterized 
5 by one or more of the following methods: HPLC, emission spectroscopy, 
and mass spectroscopy. Performance of these tests upon coelenterazine 
synthesized according to the above methods demonstrates that the coelen- 
terazine has the same chromatographic profile as natural coelenterazine; 
that the synthetic coelenterazine emits blue light at 480 nm; and that it has 
0 a mass spectroscopy profile nearly identical to that of natural coelenter- 
azine. 

Accordingly, in a further embodiment of the invention, there is 
provided purified coelenterazine and luciferyl sulfate made by the above 
5 methods. The production process of these compounds is carried out in a 
conventional manner. Transformed bacterial cells are resuspended in a 
suitable known buffer solution, followed by lysing the bacterial cells in a 
conventional manner such as ultrasonic wave treatment and/or enzyme 
treatment, and obtaining the supernatant by means of centrifugation. 
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E. IMPROVED ASSAY FOR DETECTING MUTAGENESIS 

In another embodiment, the invention provides an organism trans- 
formed with two polynucleotides, one comprising one or more sequences 
of nucleotide bases collectively encoding an amino acid sequence for a 
5 luciferase peptide compatible with coelenterazine. The second poly- 
nucleotide comprises one or more sequences of bases collectively encoding 
the amino acid sequence of said pre-coelenterazine peptide. 

One of these polynucleotides has a mutation which prevents its 
10 expression product from having a bioluminescent interaction with any 
coelenterazine. This mutation, which may suitably be an insertion, a 
duplication, a translocation, mis-sense, or a reading shift mutation, is 
desirably reversible upon exposure to a mutagen. Thus, should exposure to 
a mutagen reverse the mutation in the second polynucleotide, the organism 
1 5 will express the first and second polynucleotides, thus generating coelen- 
terazine and an active luciferase compatible therewith. 

Since reversal of the mutation results in bioluminescence, the 
organism may be employed in a mutagenesis assay. Therefore, the embodi- 

20 ment further provides a method of testing the mutagenicity of a chemical 
compound, comprising: a) transforming a population of organisms with said 
first and said second polynucleotides; b) growing a culture of said 
transformed organisms through one or more generations in a nutrient 
medium comprising said chemical compound; and c) measuring the bio- 

25 luminescence of said culture and comparing said bioluminescence to that 
from a culture of non-transformed mutagenized control organisms. 

In this method, the rate of mutagenesis may be measured instru- 
mental^, as by subjecting said mutagenized cultures to on-line luminometry , 
30 avoiding the Ames test steps of preparation of agar plates and tedium of 
scoring bacterial colonies on the plates. 
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In order to halt the growth of non-revertant organisms while 
performing this method, and to prevent "crowding out" of revertant growth 
by non-revertants, the expression of coelenterazine and luciferase may be 
linked to mutant survival. This may suitably be done by placing the 
5 mutation on a regulatory element controlling the expression of the trans- 
formed organisms' antibiotic resistance or the synthesis of an obligatory co- 
factor. Suitable locations for such a mutation include the promotor region 
or the region of repressor binding. 

10 The invention now being fully described, it will be apparent to one of 

ordinary skill in the art that many changes and modifications may be made 
thereto without departing from the spirit or scope of the invention as set 
forth herein. 

1 5 Finally, this invention provides a method for producing fluorescent 

molecular weight markers comprising: a) linking a DNA molecule encoding 
a green fluorescent protein with a DNA molecule encoding a known amino 
acid sequence in the same reading frame; b) introducing the linked DNA 
molecule of step a) in an expression system permitting the expression of a 

20 fluorescent protein encoded by the linked DNA molecule; and c) deter- 
mining the molecular weight of the expressed fluorescent protein of step b) f 
thereby producing a fluorescent molecular weight marker. 

Various expression systems are known in the art. The E. coli 
25 expression system, one of the commonly used systems is described in the 
following section. 

The determination of molecular weight may be done by comparing the 
expressed fluorescent protein of step b) with known molecular weight 
30 markers. Alternatively, the molecular weight can be predicted by calculation 
since the linked DNA sequence is known (and so is the amino acid sequence 
being encoded). In an embodiment, the expressed fluorescent protein is 
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purified. The purified fluorescent protein can be conveniently used as 
molecular weight markers. 

This invention will be better understood from the Experimental Details 
5 which follow. However, one skilled in the art will readily appreciate that the 
specific methods and results discussed are merely illustrative of the 
invention as described more fully in the claims which follow thereafter. 

HPLC . A Bio-Rad ODS-5S reverse-phase HPLC column (4x250 mm) 
10 may be used. High-performance liquid chromatography may be carried out 
on Waters 600 multisolvent system with Waters 490 programmable wave- 
length detector and Waters 740 data module. Column eluent is monitored 
at three wavelengths simultaneously so as to detect coelenterazine and UV 
absorption contamination in real time. All HPLC runs are performed in acidic 
1 5 methanolic buffer solutions, so the coelenterazine is always in the 370 nm- 
absorbing form. 

Photometric Determination. Bioluminescence may be measured and 
peak light intensities determined, with a luminometer. Bioluminescence 
20 intensity is converted to quanta per second by calibrating the instrument 
relative to a radioactive 14 C light standard that emits maximally in the 410 
nm region. Routine assays for coelenterazine are performed by rapidly 
injecting 10//I of clarified E. coli SMC2 cell extract in methanolic-HCI into a 
vial containing Renilla luciferase in 1 ml of luciferase buffer. 

25 

Analytical Spectra. A Cary 1 7-D recording spectrophotometer and a 
Bausch and Lomb Spectronic 2000 are used interchangeably for fixed- 
wavelength absorbance measurements or for spectral scans. 

30 Mass Spectroscopy. Coelenterazine isolated from recombinant 

bacteria may be analyzed by electrospray ionization mass spectrometry and 
liquid secondary ion mass spectrometry followed by mass spectrometry/ 

SUBSTITUTE SHEET (RULE 26) 



WO 95/21191 



PCMJS95/01425 



37 

mass spectrometry. This will provide molecular weight and structural 
information on the isolated coelenterazine. These two tools also provide 
information on the purity of the cyclic tripeptide as well as of other 
peptides, if present. 

5 

EXAMPLE I 
In vitro MUTAGENESIS 
TU#58 (described in US Patent Application Serial No 08/1 19,678) is 
treated with Ncol and EcoRI to generate a fragment of the GFP gene. This 
1 0 fragment is replicated by PGR with an oligomeric primer to insert the C1 97A 
mutation. The fragment is also treated with primers (Ncol at 5', T3 at 3') 
to incorporate restriction sites. The primer which incorporates the C197A 
mutation has the sequence: 

15 CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC TAT TAT G 

The "A" base located five bases upstream of the 3' terminal G base end 
constitutes the C197A mutation. 

20 The replicated fragment containing these mutations is then hybridized 

and ligated to TU#58 which has been treated with Ncol and EcoRI to 
produce plasmid TU#132. Sequencing of plasmid TU#132 confirms the 
incorporation of both endonuclease sites and of the C197A mutation. 

25 EXAMPLE II 

In vitro MUTAGENESIS 

An alternative method of generating a point mutation to that set out 
in Example I is as follows. A synthetic oligonucleotide is used to introduce 
the mutation. The synthetic oligonucleotide may be synthesized employing 
30 a commercially available automatic DNA synthesis apparatus, the product 
being subject to end-phosphorylation in a conventional manner to obtain a 
primer. 
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The oligomer is hybridized to the single strand plasmid TU#132 
disclosed in US Patent Application Serial No. 08/119,678 under conditions 
of low stringency, and subjected to three-stage treatment: treatment at 
100° C. for 5 minutes, followed by allowing the resulting material to stand 
5 at 30° for 3 minutes and further at 4° C. for 30 minutes to carry out 
annealing and reacting dXTP (X = G, A, T> C) with Klenow fragment (E. coli 
polymerase) in the presence of T4-ligase to prepare a duplex chain. 

EXAMPLE III 

10 TRANSFORMATION 

E. coli of strain BL21 (DE3)Lys S is transformed with plasmid TU#1 32. 
Transformants are cultured at room temperature and selected on media 
containing ampicillin (100//g/ml) and IPTG (0.8mM). Plasmid DNA is 
isolated from the transformants and analyzed by automated sequence 
15 analysis, which confirms the presence of the C197A mutation. 

EXAMPLE IV 

MEASUREMENT OF COELENTERAZINE SYNTHESIZED IN E. coli SMC2 
The bioluminescence of coelenterazine synthesized by E. coli SMC2 

20 cultivated as in Example III is measured on a custom built luminometer. The 
circuitry of this luminometer is modeled after that described in Blinks et al., 
Methods in Enzvmol. . 5_Z 292-328, 1 978, incorporated herein by reference; 
the reaction chamber and shutter assembly of the luminometer are modeled 
after that described in Levine and Ward, Cnmn.Bioch.Phvsiol. J2B, 77-85 

25 1982, incorporated herein by reference. 

The column is equilibrated with a starting buffer of 0.1 % trifloracetic 
acid at a flow rate of 1mL/min. Coelenterazine-containing samples are 
injected onto the column in starting buffer. Five minutes after sample 
30 injection, a linear methanol gradient (+1% methanol/min, flow rate 
1mL/min) is initiated until all components in the mixture are eluted. 
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Following each HPLC run, the column is rinsed with 100% methanol. Under 
these conditions, coelenterazine elutes at 90% methanol. 

Peak light intensities of the coelenterazine are determined on this 
5 luminometer and bioluminescence intensity is converted to quanta per 
second by calibrating the instrument relative to a radioactive 14 C light 
standard that emits maximally in the 410 nm region. 

A paste of £. coli SMC2 is lysed by sonication in absolute methanol 
10 acidified to 1N with HCI. 10//I of clarified E coli SMC2 cell extract in 
methanolic-HCI is rapidly injected into a vial containing Renilla luciferase in 
1 ml of luciferase buffer. The coelenterazine luciferase assay buffer 
described in Matthews et al., EKoctL 16 85-91, (1977) incorporated herein 
by reference is prepared. 50 jj\ of pure luciferase (prepared as described in 
1 5 Matthews et al., supra) is added to the vial. Corrected emission spectra are 
collected on an on-line computerized fluorimeter. 

The luminometer gives a reading of 1.5 x 10 7 hv/sec upon addition 
of luciferase, against a background of 4 x 10 5 hv/sec. The read-out from 
20 this instrument appears in Fig. 1 . 

EXAMPLE V 

£. coli BLR (DE3) is grown in plates under anaerobic conditions in a 
Gas-Pak container according to the instructions of the manufacturer (Becton 
25 Dickinson Microbiology Systems). Colony growth is slowed, due, it is 
believed, to the anaerobic conditions. The resulting colonies do not 
detectably exhibit bioluminescence after at least 3 days of growth under 
anaerobic conditions. However, after being exposed to air for 24 hours, the 
colonies do begin to exhibit bioluminescence. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANTS: Ward, William 

Chalfie, Martin 

(ii) TITLE OF INVENTION: BIOLUMINESCENT INDICATOR FOR GENE 
10 EXPRESSION AND DETECTION OF MUTAGENESIS BASED UPON THE 

EXPRESSION OF A GENE FOR A MODIFIED GREEN- FLUORESCENT 
PROTEIN 



15 



(iii) NUMBER OF SEQUENCES: 5 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Omri M. Behr, Esq. 

(B) STREET: 325 Pierson Avenue 

(C) CITY: Edison 

20 (D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 08837 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/192,158 

(B) FILING DATE: 04-FEB-1994 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Behr, Omri M. 

(B) REGISTRATION NUMBER: 22,940 

(C) REFERENCE /DOCKET NUMBER: RUTG3. 0-017 

40 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (908) 494-5240 

(B) TELEFAX: (908) 494-0428 

(C) TELEX: 51 1642 BEPATEDIN 



45 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ix) FEATURE: 
55 (A) NAME/KEY: Protein 

(B) LOCATION: one-of(l) 

(D) OTHER INFORMATION: /note= -Residue 1 Xaa = 
Methionyl-alanine" 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

65 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 
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Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
5 50 55 60 

Tyr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

TO His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 



15 



Thr lie Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Ser- Arg Ala Glu Val 

100 105 no 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 

115 120 " 125 



Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu Tyr Asn 
20 130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

25 He Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser Val 

165 170 175 



30 



Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val 
35 210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

40 ( 2 ) INFORMATION FOR SEQ ID NO: 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 bases 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGGCTAGCA AAGGAGAAGA ACTTTTCACT GGAGTTGTCC CAATTCTTGT TGAATTAGAT 60 
GGTGATGTTA ATGGGCACAA ATTTTCTGTC AGTGGAGAGG GTGAAGGTGA TGCAACATAC 120 
GGAAAACTTA CCCTTAAATT TATTTGCACT ACTGGAAAAC TACCTGTTCC ATGGCCAACA 180 
60 CTTGTCACTA CTTTCTATTA TGGTGTTCAA TGCTTTTCAA GATACCCAGA TCATATGAAA 240 
CAGCATGACT TTTTCAAGAG TGCCATGCCC GAAGGTTATG TACAGGAAAG AACTATATTT 300 
TTCAAAGATG ACGGGAACTA CAAGACACGT GCTGAAGTCA AGTTTGAAGG TGATACCCTT 360 

65 

GTTAATAGAA TCGAGTTAAA AGGTATTGAT TTTAAAGAAG ATGGAAACAT TCTTGGACAC 420 
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AAATTGGAAT ACAACTATAA CTCACACAAT GTATACATCA TGGCAGACAA ACAAAAGAAT 480 
GGAATCAAAG TTAACTTCAA AATTAGACAC AACATTGAAG ATGGAAGCGT TCAACTAGCA 540 
GACCATTATC AACAAAATAC TCCAATTGGC GATGGCCCTG TCCTTTTACC AGACAACCAT 600 
TACCTGTCCA CACAATCTGC CCTTTCGAAA GATCCCAACG AAAAGAGAGA CCACATGGTC 660 
CTTCTTGAGT TTGTAACAGC TGCTGGGATT ACACATGGCA TGGATGAACT ATACAAATAA 720 
ATGTCCAGAC TTCCAATTGA CACTAAAGTG TCCGAACAAT TACTAAAATC TCAGGGTTCC 780 
TGGTTAAATT CAGGCTGAGA TATTATTTAT ATATTTATAG ATTCATTAAA ATTGTATGAA 840 
15 TAATTTATTG ATGTTATTGA TAGAGGTTAT TTTCTTATTA AACAGGCTAC TTGGAGTGTA 900 
TTCTTAATTC TATATTAATT ACAATTTGAT TTGACTTGCT CAAA 944 
(2) INFORMATION FOR SEQ ID NO: 3: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
25 (D) TOPOLOGY: linear 



30 



45 



50 



60 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTCTATTATG GTGTTCAA 18 
35 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: both 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCTGTTCCAT GGCCAACACT TGTCACTACT TTCTATTATG 40 
(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 i 



Phe Ser Tyr Gly Val Gin 
65 1 5 
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WE CLAIM: 

1 . A pre-coelenterazine peptide comprising a modified Aequorea 
victoria GFP wherein R 65 is Tyr. 

5 

2. The pre-coelenterazine peptide of Claim 1 comprising at least 
amino acid residues R 1 through R 228 of said modified A. victoria GFP. 

3. The pre-coelenterazine peptide of Claim 1 comprising amino 
10 acid residues R 1 through R 238 of said A. victoria GFP, wherein R 80 is Gin or 

Arg. R 100 is Phe or Tyr, R 108 is Thr or Ser, R 141 is Leu or Met, R 172 is Glu or 
Lys, and R 219 is Val or lie. 

4. The pre-coelenterazine peptide of Claim 3 wherein R 80 is Gin, 
15 R ,0 ° is Phe, R 108 is Thr, R 1 * 1 is Leu, R 172 is Glu and R 219 is Val. 

5. The pre-coelenterazine peptide of Claim 3 wherein R 80 is Gin, 
R ,0 ° is Tyr, R 108 is Ser, R 141 is Met, R 172 is Glu and R 219 is He. 

20 6. The pre-coelenterazine peptide of Claim 4 wherein R 1 is 

methionyl-alanine. 

7. A polynucleotide comprising one or more sequences of nucleo- 
tide bases collectively encoding the pre-coelenterazine peptide of Claim 1 . 

25 

8. A polynucleotide comprising one or more sequences of nucleo- 
tide bases collectively encoding the pre-coelenterazine peptide of Claim 4. 

9. The polynucleotide of Claim 8 comprising the cDNA poly- 
30 nucleotide gfpJC 1 97 A) . 
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10. The polynucleotide of Claim 7 further comprising one or more 
sequences of nucleotide bases collectively encoding the amino acid 
sequence of a luciferase compatible with coelenterazine. 

5 11. The polynucleotide of Claim 7 further comprising one or more 

sequences of nucleotide bases collectively encoding the amino acid 
sequence of apo-aequorin. 

12. An expression vector comprising the polynucleotide of Claim 
1 0 7, and further comprising one or more sequences of nucleotide bases which 

encode at least one regulatory element operatively linked to said one or 
more sequences encoding said pre-coelenterazine peptide. 

13. An expression vector of Claim 12 further comprising one or 
1 5 more sequences of nucleotide bases which collectively confer resistance to 

an antibiotic upon an organism transformed therewith. 

14. The expression vector of Claim 12 comprising the nucleotide 
sequence of plasmid TU#132 (ATCC Accession No. 75666). 

20 

15. The expression vector of Claim 12 wherein said regulatory 
element is a promoter selected from the group consisting of promoters from 
a P450 gene, a promoter activated by a heavy metal, and a promoter from 
a gene encoding a stress protein. 

25 

16. The expression vector of Claim 12 further comprising one or 
more sequences of nucleotide bases collectively encoding a luciferase 
compatible with coelenterazine. 

30 17. The expression vector of Claim 1 6 further comprising one or 

more sequences of nucleotide bases encoding a further regulatory element 
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operatively linked to said one or more sequences of nucleotide bases 
encoding said luciferase. 

18. The expression vector of Claim 17 wherein said at least one 
5 regulatory element operatively linked to said one or more sequences 
encoding said pre-coelenterazine peptide is the same as said further 
regulatory element operatively linked to said one or more sequences 
encoding luciferase. 

10 19. The expression vector of Claim 17 wherein said at least one 

regulatory element operatively linked to said one or more sequences encod- 
ing said pre-coelenterazine peptide differs from said further regulatory 
element operatively linked to said one or more sequences encoding lucifer- 
ase. 

15 

20. The expression vector of Claim 1 2 further comprising one or 
more sequences of nucleotide bases collectively encoding apo-aequorin. 

21 . An organism transformed with the polynucleotide of Claim 1 2. 

20 

22. The organism of Claim 21 which is an animal, bacterial, plant 
or insect cell. 

23. The animal cell of Claim 22 which is selected from the group 
25 consisting of invertebrate, vertebrate and mammalian cells. 

24. An organism transformed with the expression vector of Claim 

14. 

30 25. The organism of Claim 24 selected from the group consisting 

of E. co// BLR (DE3) and £. co//SMC2 (ATCC Accession No. 69553). 
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26. An organism of Claim 21 further transformed with a second 
polynucleotide comprising one or more sequences of nucleotide bases 
collectively encoding a luciferase compatible with coelenterazine. 

5 27. An organism transformed with the expression vector of Claim 

16. 

28- An organism of Claim 21 further transformed with a second 
polynucleotide comprising one or more sequences of nucleotide bases 
10 collectively encoding apo-aequorin. 

29. An organism transformed with the expression vector of Claim 

20. 

15 30. The organism of Claim 29 which is a squid giant neuron. 

31. A method of expressing the polynucleotide of Claim 7 com- 
prising incubating said polynucleotide in the presence of means for effecting 
expression of said polynucleotide under conditions favorable to expression 

20 of said polynucleotide. 

32. A method of synthesizing coelenterazine comprising expressing 
said polynucleotide according to Claim 3 1 and collecting coelenterazine from 
said means. 

25 

33. The method of Claim 32, wherein said step of incubating said 
polynucleotide is preceded by transforming said organism with said poly- 
nucleotide; said means for effecting expression of said polynucleotide is an 
organism transformed with said polynucleotide; said step of incubating said 

30 polynucleotide in the presence of said means comprises culturing said 
transformed organism for one or more generations under conditions favor- 
able to growth of said transformed organism and favorable to expression of 
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said polynucleotide; and said step of collecting coelenterazine is performed 
by lysing the progeny of said cultured transformed organism to form a cell- 
free extract. 

5 34. The method of Claim 33 wherein said means for effecting 

expression of said polynucleotide is a cell selected from the group consisting 
of £. co/i SMC2 (ATCC Accession No. 69553) and E. coli BLR (DE3) 
transformed with an expression vector comprising said polynucleotide. 

10 35. The method of Claim 33, wherein said organism is cultured 

aerobically. 

36. The method of Claim 33 wherein said transformed organism is 
cultured in the presence NADP. 

15 

37. The method of Claim 31 wherein said polynucleotide is a poly- 
ribonucleotide, and said means for effecting expression of said polyribo- 
nucleotide is a cell-free aqueous translation system. 

20 38. The method of Claim 30 further comprising converting said 

collected coelenterazine to luciferyl sulfate. 

39. The method of Claim 38, wherein said converting is performed 
by incubation of said coelenterazine with a luciferin sulfokinase. 

25 

40. Purified coelenterazine synthesized by the method of Claim 32. 

41 . Purified luciferyl sulfate synthesized by the method of Claim 

38. 

30 

42. A method for selecting cells expressing a protein of interest, 
wherein said cells comprise a polynucleotide comprising one or more 
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sequences of nucleotide bases collectively encoding said protein of interest 
and further comprising a regulatory element operatively linked to said 
encoding sequences, said method comprising 

a) transforming said cells with the expression vector of Claim 10; 
5 b) culturing said cells under conditions permitting expression of said 

pre-coelenterazine peptide and the protein of interest; and 

c) selecting the cultured cells which express coelenterazine, thereby 
selecting cells expressing the protein of interest. 

10 43. A method for detecting expression of a gene of interest in a 

cell which comprises: 

a) introducing into a cell a polynucleotide comprising one or more 
sequences of nucleotide bases collectively encoding a regulatory element 
and said gene of interest, and a polynucleotide of Claim 5, such that the 

1 5 regulatory element of the gene controls expression of pre-coelenterazine 
peptide; 

b) culturing said cell in conditions permitting expression of the gene 
of interest and of said pre-coelenterazine peptide; and 

c) detecting the expression of coelenterazine in the cell, thereby 
20 indicating the expression of the gene in the cell. 

44. A method for detecting increased levels of intracellular calcium 
ion, comprising 

a) culturing an organism of Claim 28 under conditions favorable to 
25 growth of said transformed organism and favorable to expression of said 

pre-coelenterazine and apo-aequorin peptides; and 

b) monitoring said culture for exhibition of bioluminescence. 

45. A method for detecting the presence of 0 2 in an anaerobic 
30 system, comprising 

a) culturing an organism of Claim 26 under conditions favorable to 
growth of said transformed organism and favorable to expression of said 
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pre-coelenterazine and luciferase peptides, where said organism is a 
facultative anaerobe; and 

b) monitoring said culture for exhibition of bioluminescence. 

5 46. The organism of Claim 26 wherein said second polynucleotide 

comprises, in its one or more sequences of nucleotide bases, a mutation 
which precludes a bioluminescent interaction between an expression 
product of said second polynucleotide with an expression product of said 
first polynucleotide, said mutation being reversible upon exposure to a 
10 mutagen to enable a bioluminescent interaction between said expression 
products. 

47. A method of testing the mutagenicity of a chemical compound, 
comprising: 

1 5 a) growing a culture of said organism of Claim 41 through one or 

more generations in a nutrient medium comprising said chemical compound; 
and 

b) measuring the bioluminescence of said culture and comparing said 
bioluminescence to that from a culture of said organisms of Claim 41 grown 
20 in the absence of said chemical compound. 
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